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The problem to express a natural number N as a product of natural numbers without regard to or- 
der corresponds to a thermally isolated non-interacting Bose gas in a one-dimensional potential with 
logarithmic energy eigenvalues. This correspondence is used for characterising the probability distri- 
bution which governs the number of factors in a randomly selected factorisation of an asymptotically 
large N. Asymptotic upper bounds on both the skewness and the excess of this distribution, and on 
the total number of factorisations, are conjectured. The asymptotic formulas are checked against 
exact numerical data obtained with the help of recursion relations. It is also demonstrated that for 
large numbers which are the product of different primes the probability distribution approaches a 
Gaussian, while identical prime factors give rise to non-Gaussian statistics. 

PACS numbers: 05.30.Ch, 05.30.Jp, 02.30.Mv 



I. INTRODUCTION 



Each integer number N can be written in a unique way as the product of prime numbers Pi with integer multiplicities 

Tli, 

ni 

N = ]J P ?. (1) 

i=l 

Every other possibility to decompose N into integer factors larger than 1 is obtained by multiplying out some of its 
prime factors. For N = 30, for instance, we have the factorisations 

30 = 2-3-5 
= 5-6 
= 3-10 
= 2-15 

= 30 . (2) 

It is understood that the ordering of the factors does not matter here, so that 5 • 6 is not distinguished from 6 • 5. 
Let $>(N, k) denote the number of such factorisations of N which contain exactly k factors, and let £l(N) be the total 
number of factorisations. Then, according to the above list, $(30,1) = 1, $(30,2) = 3, and $(30,3) = 1, giving a 
total of SI (30) = 5 factorisations. 

It is obvious that the number £l(N) of possible factorisations of N does not only depend on the total number of 
prime factors of N, but also on the multiplicities with which the different prime factors occur: Taking N = 12, we 
have 

12 = 2-2-3 
= 2-6 
= 3-4 

= 12, (3) 

so that $(12,1) = 1, $(12,2) = 2, and $(12,3) = 1, adding up to £1(12) = 4. Since two of the three prime factors 
of N = 12 are equal, the number of different combinations of prime factors is less than for N = 30, which possesses 
three different prime factors. 

We therefore divide the natural numbers into equivalence classes: Two numbers N\ and N2 are said to be equivalent 
if they give rise to the same pattern of factorisations, meaning that $(iVi,fe) = $>(N2,k) for all k. Particular equi- 
valence classes consist of numbers which are some power of a single prime p: If N = p m , or 



In N = m lnp 



(4) 



2 



for some integer m, the task of factorising N is equivalent to the task of partitioning the exponent m into integer 
summands, i.e., to the famous number-partitioning problem partitio numerorum introduced by Euler 1 . This problem, 
which in itself plays a significant role in several areas of modern mathematics^**^, is connected to a number of topics 
occuring in statistical physics, ranging from lattice animals^ and combinatorial optimisation^ over Fermion-Boson 
transmutation^ to quantum entropy and energy currents flowing in a one-dimensional channel connecting thermal 
reservoirs** 

In the general case, taking the logarithm of the prime factor decomposition Q yields 



In N = ^2<ni In p t . (5) 



Seen from the viewpoint of statistical physics, this latter equation allows for an interesting interpretation: If we 
consider an ideal Bose gas consisting of sufficiently many particles, confined such that the single-particle energies e v , 
when suitably made dimensionless, are given by the logarithms of the primes, e v = h\p v with p v = 1, 2, 3, 5, 7, 11, . . . for 
v = 0, 1, 2, 3, 4, 5, . . . , the equation J5| indicates one particular microstate of the system, where the total energy In TV 
is distributed among the particles in such a way that is the occupation number of the energy level £j. Additional 
particles which are not excited remain in the ground state, forming a Bose-Einstein condensate. Such surplus particles 
do not contribute to the energy, since £o = lnl = 0. Ideal Bose gases with a single-particle spectrum corresponding 
to the sequence of the logarithms of the prime numbers have recently been studied by Tran and BhaduriiS,, with 
particular emphasis placed on differences between the fluctuation of the number of condensate particles in different 
statistical ensembles. 

A few remarks concerning low-dimensional, ideal Bose-Einstein condensates might appear in order. The example 
of an ideal Bose gas of M particles in a one-dimensional harmonic potential with oscillator frequency u captures the 
essentials: If one considers the usual thermodynamic limit, meaning M — > oo and u> — ► such that the product Mw 
remains constant, the Bose-Einstein condensation temperature approaches zeroii. However, this is not the relevant 
limit here; we rather have to consider the limit of large particle numbers at constant oscillator frequency. In this 
case there still exists a nonzero characteristic temperature Tq for the onset of a macroscopically large occupation of 
the ground state, roughly given by Tq = (Huj /kg)M / ln(M), which approaches infinity for M — > ooA2*i*V Thus, there 
exists a condensate for any finite temperature, if the particle number is sufficiently large. 

Since we wish to treat all possible factorisations (not only those into primes) of a given number N without regard 
to the order of the factors, the Bose-gas analogy requires the single-particle spectrum 

e v =ln(i/+l) i/ = 0,1,2,... . (6) 

We point out that such a spectrum might actually be realisable: As shown in the appendix^ within the quasi-classical 
approximation the eigenvalues of a particle in a one-dimensional logarithmic potential V(x) — Vq ln(|x|/L) are given 
by Vo ln(2^ + 1), up to a constant; the restriction to odd numbers 2v + 1 is not essential. 

The key point here is that the analogy between the factorisation problem and an ideal Bose gas with logarithmic 
single-particle spectrum allows us to invoke well-established methods from statistical physics for obtaining information 
on number-theoretical properties of large composite integers. In this paper, we focus on the probability distribution 

, x $(N,k) 

for given (asymptotically) large integers N , i.e., on the probability of finding k factors in a randomly selected factori- 
sation of a large N. We will proceed as follows: In section^ we state recursion relations required for the numerical 
evaluation of the exact quantities Q(N, k), deferring the derivation to appendix iBl We then briefly explain in sec- 
tion II I II the method used to obtain, by means of a detour from the microcanonical to the canonical ensemble and 
back, asymptotic expressions for the cumulants of the distributions Q. In the following, we focus on the two extreme 
kinds of equivalence classes, namely those made up of powers of a single prime on the one hand, and those consist- 
ing of products of different primes on the other, and state the respective asymptotic formulas for the cumulants in 
sections IlIII and llVl In this way, an interesting feature will become apparent: While the presence of identical prime 
factors in the first case introduces Bose-like correlations which prevent the distributions J7J from becoming Gaussian 
even in the asymptotic limit, large products of different primes do indeed lead to almost Gaussian distributions. The 
paper closes with a brief summary in section Ivl 

In passing, we point out that the problem considered in this paper should be clearly distinguished from a similar 
problem known as factorisatio numerorum, first investigated by Kalmar 14 . In this latter connection one counts all 
ordered sequences (m, ri2, ■ ■ ■ , Uk) of integers m, ri2, • ■ ■ , > 2 which yield N when multiplied, ni • n% ■ ■ ■ ■ • Hfe = N. 
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FIG. 1: The number of possibilities Q(N) to factorise an integer N into products of natural numbers, for 1 < N < 10 6 . For 
each N, the exact value £l(N) has been computed with the help of the recursion relation (j5J. While there are lots of numbers 
which are products of two primes, giving Q(N) — 2, only relatively few numbers can be written as the third power of a prime, 



implying ft(N) = 3. Composite numbers like 17280 
high values of Q(N), indicated by crosses. 



2 7 ■ 3 3 • 5 , 120960 = 2 7 • 3 3 ■ 5 • 7 , or 725760 = 2 s ■ 3 4 • 5 ■ 7 yield rather 



Denoting the number of such ordered sequences by ajv (with a± = 1), one deduces X)jv>i a ^-^ s = [2 — C( s )] 
where £(s) is Riemann's zeta function; the task then is to find the asymptotic behaviour of the sum function 



A(x) 



E 

KJV<i 



UN 



(8) 



Recent results, and further information on this problem, have been collected in ref^. In contrast, in the present 
paper we do not count different orderings of factors as different configurations, or microstates. It is precisely this 
identification of different orderings which leads to the Bose-gas analogy (with Bosons, the question "which particle 
occupies which state" is meaningless), and thus opens up the avenue followed here. 



II. RECURSION RELATIONS 



In order to determine the exact distributions J7J), we employ a recursion relation: Let Tk(N) be the number of 
factorisations of N into k or less factors. Then, as shown in appendix [5] one has 



T k (N) 



with 



1 k 
£?E 



N 

E 

u = l 



IV„ (N/u n ) 



(9) 



r fe (l) = l and r o (A r >l)=0. 



Contributions to the second sum in equation © arise only when v n divides N. Starting from Ti(N) = 1, the sequence 
Tk(N) increases with increasing k (unless N happens to be a prime) until it reaches its final value Q(N), since N 
cannot be expressed as a product of more than log 2 (iV) integer factors greater than 1: 



Q(N)=T k (N), fc>log 2 (iV) 



(10) 



Figure shows a logarithmic plot of Cl(N) for 1 < N < 10 6 . Each data point [N, &(N)] has been marked by an 
individual dot; the equivalence classes clearly manifest themselves as horizontal lines. While these data might suggest 
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FIG. 2: Exact skewness 71 (TV), as defined in equation f° r the probability distributions 10, with 1 < TV < 10 6 . There are 
some numbers TV for which the distribution is symmetric, so that 71 (TV) = 0. In contrast, numbers which are integer powers 
of 2 give rise to a particularly large skewness, as indicated by the crosses. 



an upper bound on Q(TV) on the order of TV 77 , the reader should be warned that the true asymptocis are not reached 
for TV as small as 10 6 ; a correct upper bound will be stated later. 

The number of possibilities to factorise TV into exactly k factors is now given by 



$(TV,A;) = r fe (TV)-r fc _ 1 (TV) , 



(ii) 



so that we have access to the probability distribution Q . As in previous investigations of ground-state fluctuations 
of non-interacting and weakly interacting Bose gasesi^iLi^ it is useful to characterise such a distribution in terms 
of its cumulants /c^US, which directly quantify its deviation from a Gaussian: For a Gaussian distribution, «W = 
for £ > 2. The lowest four cumulants are related to the mean value (k) and the central moments fiS^ = ((k — (k)) e ) 
through the relations^ 



(1) 


= (*>, 


(2) 


= M (2) , 


(3) 


= M (3) , 


(4) 


= M (4) - 



Normalising the third and the fourth cumulant we respect to the second, one obtains the skewness 



71 



,-(2) 



3/2 



which characterises the asymmetry of the underlying probability distribution, and the excess 



72 



2 ' 



(12) 



(13) 



(14) 



which characterises its flatness. 

In figure [21 we display exact data for the skewness 71 (TV) of the distributions , revealing that their asymmetry 
becomes particularly large when TV = 2 m . It is clear that all numbers within the same equivalence class yield the 
same cumulants, and thus the same values of 71 and 72. 
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Since the general algorithm requires rather a large amount of computer memory, it pays to focus on particular 
equivalence classes: For numbers N which are powers of a prime p, 

N=p m , (15) 

one has 

min{m — k,k} 

$(p m ,k) = Yl *(p m ~ k ,v); (is) 

this recursion relation still has to be solved numerically If, on the other hand, N is a product of distinct primes, 

m 

N = Y[pi, Pi^ Pj for i j , (17) 

i=l 

the probability distribution is given by the relation 

* (n?=i % . k ) = $ (n™ - ! 1 Pi,k-i)+ (n;^ 1 Pj , k) m 

with &(pi, 1) = 1 and ^(Oi^i ft > m ) = 0- As explained in appendix IO this relation even admits a closed solution: 



Kn^ft^h^E^V- as) 

III. ASYMPTOTIC FORMULAS FOR FACTORISING POWERS OF A PRIME 

The problem of finding all factorisations of a given number N corresponds, within the Bose-gas analogy, to a 
problem of microcanonical statistics: Given the total energy In AT, the task is to find all possible microstates, i.e., all 
possibilities for distributing the energy over the available single-particle levels £„. Particles which are not excited and 
thus remain in the condensate contribute with lneo = to the energy, or with a factor of 1 to the product. Hence, the 
picture to have in mind is that of a partially condensed Bose gas, with the excited particles corresponding to factors 
larger than 1, and a sufficiently large supply of condensed particles in the ground state. 

While the microcanonical requirement to keep the total energy constant introduces severe technical difficulties, the 
statistics of partially condensed ideal Bose gases become much simpler in the canonical ensemble, where the system 
is kept in contact with a thermal reservoir of temperature T . It is precisely at this point that the Bose-gas picture 
becomes essential: One can obtain information about the microcanonical number factorisation problem by invoking 
the notion of temperature. 

We employ the first energy gap E\ — Sq to introduce the dimensionless inverse temperature 

&=(£!- e )/3 , (20) 

where (3 — X/{k^T), as usual. A decisive role for the canonical statistics of a partially condensed, ideal Bose gas 
is played by the grand canonical partition function S cx (6, z) of an auxiliary system from which the single-particle 
ground state Eq has been removed, while all other levels e„ remain unaltered. If there is an infinite supply of 
condensed particles, in the spirit of the so-called Maxwell's demon ensemble^, this function has the exact integral 
representation^ 1 ^ 



1 



t-Hoc 



lnS cx (6,z) = ^- / dtb-trWnWgt+iiz), (21) 



T - — lOO 



where 



6 



denotes the familiar Bose functions^, and 



i/=i 



£\ — £o 
£v — £o 



(23) 



is a scries determined by the single-particle spectrum. The real number r > in equation (|21l) is chosen such that 
all poles of the integrand lie on the left hand side of the path of integration. The value of this representation i|21[) . 
and of the subsequent representation (|25[1 . lies in the fact that it lends itself to the derivation of an asymptotic series 
which captures the low-6-behaviour, if one collects, from right to left on the real axis, the residues of the respective 
integrands. With respect to an actual Bose gas, the underlying assumption of an infinite number of ground-state 
particles restricts the validity of equation Ij21|l to temperatures low enough to guarantee the existence of a condensate. 
In the case of the number factorisation problem there is no similar restriction, owing to the fact that multiplying any 
product by an arbitrary amount of factors of unity does not change its value: In the number-theoretic context the 
representation l|21|) is exact. 

The £-th cumulants Ken (b) of the canonical distribution governing the number of excited particles in the gas are 
then immediately obtained by differentiation, 



d 



«W(6)= [z- ) lnS ex (M 



(24) 



giving 



1 
2~7ri 



dt b~ l r(t) r)(t) C(t + l- £) 



(25) 



where C,(t) — n ~ t ^ s the Riemann zeta function. We will only consider non-interacting Bosons in trapping 

potentials with single particle energy levels e v (v — 0, 1, 2 . . .) such that the series ()2.'{l) converges for t > to, with some 
real to > 0. 

The strategy of employing the canonical ensemble for solving a microcanonical problem hinges on the possibility to 
get rid of the temperature in a second step, and to find an expression for the microcanonical cumulants. This is done 
with the help of the saddle-point method^: The "energy-temperature" relation reads 



d 

n + 1 = - — \n^ cx (b, z) 
ob 



(26) 



b=b (z) 



where n denotes the energy in units of E\ — Eo, and bo{z) is the saddle point. The generating function of the 
microcanonical cumulants then takes the form24 



\nY(n,z) = In z, cx (bo(z), z) + nbo(z) — — In 2ir 
— In | — In S rx (6, z 

and the desired microcanonical cumulants are finally calculated by taking derivatives, 



(27) 



(n) 



z— In Y (n, z) 
dz 



(28) 



The method sketched here requires knowledge about the analytical properties of the function rj(t) introduced in 
equation l|23|) . For applications to the number factorisation problem, the single-particle energies are determined by 
the equivalence class of the number N: In the simplest case (|15fl . the possible single-particle energies are integer 
multiples of lnp, so that rj(t) coincides with the Riemann zeta function C(^)> & n d the factorisation of N is equivalent 
to the partition of m = log p iV into summands. The evaluation of the above formalism for the partition problem 
has already been discussed in detail^, so we merely quote the results: For numbers of the type l|15l) . the asymptotic 
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FIG. 3: Skewness (JTHJ for large numbers N = 2 m and TV = (10 1000 +453) m , as obtained from the recursion relation JTfjJ (dots). 
The dashes correspond to the evaluation of the asymptotic formula 12911 for TV = 2 m ; the horizontal line indicates the maximum 
value 71, max ~ 1.1907. 



formula for the skewness (|13[1 reads 

12V6C(3) 



7i 



+ 



+ 



1 



1 



0.10128 [ln(log p (TV))] 2 - 0.37376 ln(log p (JV)) - 1.7078 



log P (A0 



0.0075008 [ln(log p (TV))] +0.025681 [ln(log p (TV))] ' 



+ 0.020024 [ln(log p (TV))]' 
+ G(log p (TV)- 3 / 2 ) , 
while the excess l|14[l takes the form 



0.23028 ln(log p (TV)) - 0.56984 



72 



= 2.4 



^log P (iV) 



0.28440 [ln(log p (TV))] 2 - 0.56714 ln(log p (TV)) - 10.064 



+ 



log p (TV) 



0.025276 [ln(log p (TV))] 4 + 0.022329 [ln(log p (TV))] 



(29) 



- 0.33809 [ln(log p (TV))] + 0.73538 ln(log p (TV)) + 3.7863 

+ 0(log p (TV)- 3 / 2 ) . (30) 

In order to demonstrate the accuracy of these formulas, we compare in figures [3] and 0] exact values for the skewness 
and the excess, corresponding to large numbers TV = p m , with the predictions of the equations l|29|l and (|30|l . The 
agreement is excellent. It is also visible that both 71 (TV) and 72 (TV) approach constant values only for numbers TV of 

the order > . The finding that the limiting values of skewness and excess are nonzero expresses the fact that for 
numbers of the form l|15|) the distribution Q remains non-Gaussian even for asymptotically large TV. 

The exact numerical data further reveal that both skewness and excess reach their limiting values from above; the 
skewness adopts its maximum value 

7l , max ~ 1.1907 (31) 
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FIG. 4: Excess O for l ar S e numbers TV = 2™ and TV = (10 1000 + 453) m , as obtained from the recursion relation 116H (dots). 
The dashes correspond to the evaluation of the asymptotic formula I3LIH for TV = 2 m ; the horizontal line indicates the maximum 
Value 72, max — 2.5120. 



for TV = p . The maximum of the excess, 



72 , max ~ 2.5120 



(32) 



lies at TV = p 



5507 



IV. ASYMPTOTIC FORMULAS FOR FACTORISING PRODUCTS OF DIFFERENT PRIMES 

We now turn to the equivalence classes consisting of products of different primes, and restrict ourselves to numbers TV 
which are the product of the first m primes, 



N = Y[ Pi , 

where m is large. In this case, the single-particle energies accessible to the Bose gas are given by 

m 

£ Wi=i...m =y^vj in fe) > u * e {°> !} • 



(33) 



(34) 



The restriction Vi € {0, 1} stems from the fact that each prime pi does either contribute to a given factor of TV, or it 
does not. The task then is to characterise the analytical properties of the series 77(f) corresponding to this spectrum. 
We circumvent this severe difficulty by considering a simpler, but combinatorically equivalent problem: We replace 
the actual energy levels (|34|l by 



€{0,1} 



(35) 



where a > 1 is a transcendental number which has to assure the "prime" character of exp(a m ) in the sense that it 
be not possible to express a m as a sum of the form X)I=i n i a% i f° r arbitrary integers rij, since a prime cannot be 
represented as a product of arbitrary powers of lower primes, and any m, since the function 77(f) provided by the 
substitute l|35|l acquires a contribution for each m. This requirement forces us to take the limit a — > 1 in the end: 
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Otherwise, there would be an infinite sequence of integers m for which the forbidden equality a m = n i a% could 

be satisfied with arbitrary accuracy. It is also clear that the total energy Y^iiLi m (Pi) nas to be replaced by 

m 

n = J2 a '~ X ■ ( 36 ) 

i=l 

With this surrogate, we now have 

oo 1 

»»(*)=£ E 7 \t ' (37) 



and easily obtain upper and lower bounds on the second sum, assuming t > 0: 



E t — ^ , n; X7 >y— tt ■ (38) 



(a m )* 



Thus, for f > J°r^ we can perform the sum over m and obtain 



implying that 77(f) has a simple pole at 

W2) 
' ~ ln(a) 

with a residuum r which can at least be estimated, 



(40) 



ln(2) 

— > r > Mr-* . (41) 

In a In a 

These informations suffice for an approximate evaluation of the integral representation (|25|) of the canonical cumulants. 
We are interested in their asymptotic behaviour for small b, as defined in equation (|20|l : While the temperature has 
to be low enough to guarantee the existence of a Bose condensate, UbT =1/(3 has to remain large in comparison with 
£1 — eq, so that sufficiently many states above the ground state remain populated. Now the small-6-asymptotics of 
the cumulants (|25[) are determined by the rightmost pole of the respective integrand. Since T(t) has simple poles for 
t = 0, —1, —2, . . . , and ((t + 1 — £ ) has a simple pole at t = £, the dominant pole is provided by 77(f) at f = c : Taking 
a close to unity, as required, results in a value of c larger than any fixed, finite number. 
Restricting ourselves to this dominant pole, the residue theorem then yields 

^W~ r(cK(C + 1 -° r , 1 = 0,1,2,..., (42) 
with the ~-sign indicating asymptotic equality. It follows immediately that 

Um ^" (6) e = , I > 2 , (43) 

6 - 1 

which means that the canonical distribution becomes Gaussian-like for small temperatures. 

Next, one needs the energy-temperature relation i|26[l in order to return to the microcanonical ensemble. Since, 

according to equation (|24|) . lnS 0X (6, 1) coincides with K^(b), to leading order this relation takes the form 

H „).(«±1)I)*, (44) 
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and the leading-order term of the microcanonical cumulants i|28|) is given by 



mcl ' [b(n)] c ■ [ ' 



Utilising 

r(c + i)V(«H-i) 



exp(-l) (46) 



for c — > oo, we then find 

«W(n)~exp(-l)(n+l), £>1. (47) 



' '.(2) 



As an immediate consequence, we have 



^mc (w) 

so that the approach to a Gaussian is also met in the microcanonical ensemble. 

We now have to get rid of the auxiliary energy-like quantity n, and to re-introduce the number to be factorised, N. 
By virtue of the definition l|36|l one has n ~ m for a — ► 1, meaning that the "energy" approaches the number of prime 
factors of N when N becomes large. Hence, we find 

71 ~ 7 < lu rfj (49) 

Vexp(-l) (m + 1) 

and 

exp(— 1) [m + 1) 

so that skewness and excess have been expressed in terms of the number m of primes contained in the product i|33fl . 
provided m is sufficiently large. The final links in our chain of arguments are then provided by two results from 
analytic number theory: Firstly, if N equals the product of the first m primes from 2 to p m , as in our case l|33fl . one 
has^ 

\nN~ Pm . (51) 
Secondly, the number ir{p m ) of primes less than p m , which is m, is asymptotically given by^ 

These estimates combine to yield 

ln(iV) 



1 (•")-!) 

ln(N) ^ 



"" ln(ln(iV)) ' ^ 53 ^ 

Thus, the final asymptotic expressions for the skewness and the excess of the distributions J7J) pertaining to large 
numbers TV expressable as a product of all primes up to some p m become 

ex P(-!) ( hi.hM.vn 
and 

72 " «P(-D(» + l) ' (55) 

Again, we check these results against exact numerical calculations: The figures and depict skewness and excess 
of the probability distributions Q corresponding to large numbers N of the type ll-i-il) , again contrasting exact data 
points with the predictions of the asymptotic formulas. For smaller N, the agreement is not quite as good as in the 
previous figures El and 01 This can be attributed to the fact that equations l|54(l and l|55|l merely stem from a leading- 
order analysis, as necessitated by the rather complicated function rj(t) underlying the iV studied here. Nonetheless, 
the approach to the Gaussian limits 71 = and 72 = is captured correctly. 
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FIG. 5: Skewness il'M for large numbers N which are the product of the first m primes. Crosses indicate exact numerical data, 
obtained with the recursion relation 1181 : the full line corresponds to the asymptotic expression 1541 . 



DISCUSSION 



We have used the correspondence between an ideal Bose gas with logarithmic single-particle levels and the number 
factorisation problem to characterise the "number-of- factors" -distribution for large integers N. The properties of 
this distribution depend on the equivalence class of N, that is, on the multiplicity of its prime factors. The case ljl5|L 
with all the prime factors being equal, constitutes one extreme; we have shown that for numbers of this type the 
skewness (|13|) and the excess <|14f) asymptotically approach the limiting values 71.00 = 12v6 ((3)/ir 3 ~ 1.1395 and 
72.00 = 2.4. We conjecture that for any number N, the skewness 71 (N) and the excess 72 (N) do not exceed the 
maximum values <|31[) and H32[) adopted by numbers of this particular type: 

Conjecture 1 For every integer N , the skewness of the probability distribution governing the number of 

factors in (x randomly selected product decomposition is bounded from above by 'yx, max — 

1.1906570491, while its 

excess \14)) ^ bounded by r y , 2. max — 

2.5119565935. 

Moreover, for a given total number of prime factors (which equals YliLi n i f° r N — 11^=1 P? 4 )) the total number 
of factorisations fl(N) becomes largest for integers N of the type H17|) . for which all prime factors differ from each 
other. Upper and lower bounds on the number of factorisations for integers of this particular type have been derived 
in appendix |0 While the lower bound clearly pertains only to the equivalence classes considered there, we conjecture 
that the upper bound inferred from the inequality l|Cll|) holds for all large integers. Again utilising the asymptotic 
relation l|53|l . we thus formulate 

Conjecture 2 For large integers N, an upper bound on the number Q(N) of possible factorisations is provided by 

In(iV) 



- ln(iV). 



ln(ln(iV)) 



(56) 
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FIG. 6: Excess Jl ip for large numbers N which are the product of the first m primes. Crosses indicate exact numerical data, 
obtained with the recursion relation 11811 : the full line corresponds to the asymptotic expression 1551 . The inset demonstrates 
that this expression actually describes the asymptotics correctly. 



In figurc0we display the bounds (|C11|) for products of different primes, together with exact numerical data. Despite 
the somewhat crude approximations, these bounds describe the data quite well. Hence, when ln(ln(iV)) 3> 1, the 
number fi(iV) grows as N at least for some integers N, which was not obvious at all from the limited data collected 
in figure n 

The actual content of conjecture [2 lies in the circumstance that it might still be possible to trade equality of prime 
factors for smallness of N: Constructing some composite integer by choosing more than one prime factor equal to 
Pi = 2, say, certainly reduces the value of O below the one that is attained when all prime factors are different, but 
also reduces the value of N itself. In this way, one might try and maintain a relatively high value of Sl(N), while 
minimising N. Indeed, the three examples singled out by the crosses in figure ^ indicate that in some cases such 
a procedure might lead to data points which fall at least close to the envelope of all pairs [N, f2(JV)]. However, we 
conjecture that the bound (|56|l holds nonetheless. 

For powers of a single prime p, the factorisation problem is equivalent to Euler's number partitioning problem 1 , so 
that the asymptotics can be deduced from the Hardy- Ramanuj an formula— for the number of partitions: 



ln[fi(iV)] ~ | log p N for TV = p rn . (57) 

As expected, in this case ln[0(AT)] lies well below the conjectured upper bound lj5U|) . On the other hand, a strict 
upper bound can be established as follows: N cannot be the product of more than log 2 (iV) primes. Assuming that 
these are all different (which can only overestimate the number of factors), and again using the result ildl|) . one finds 



ln[fi(A0] < log 2 (AT) ln[log 2 (A0] for all N . (58) 

Our conjectured bound (|56[) is clearly stronger than this "safe" one. 

Besides these number-theoretical insights made possible by the close correspondence between the microcanonical 
statistics of an ideal Bose gas and the factorisation problem, there also is a conceptual aspect of our work: For 
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10 100 1000 
m 

FIG. 7: Upper and lower bound ijCllfl for the logarithm of the number of factorisations of large integers N which are the 
products of the first m primes, in comparison with exact numerical data (crosses). It follows that an asymptotic upper bound 
on lnfi(iV) behaves at least as lniV. 

numbers l|17(l with different, i.e., distinguishable prime factors, the probability distribution Q, when normalised 
to unit variance, approaches a Gaussian for N — » oo. In contrast, if the prime factors are taken to be equal, i.e., 
indistinguishable, the distribution remains distinctly non-Gaussian even in the asymptotic limit. Thus, we encounter 
here a fairly nontrivial model for the occurrence of non-Gaussian statistics. 
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APPENDIX A: EIGENVALUES FOR A LOGARITHMIC POTENTIAL 



Let us consider the motion of a particle with mass m in a one-dimensional potential 

V(x) = V ln(^pj , (Al) 

where Vq and L are positive constants with the dimension of energy and length, respectively. Within the quasi-classical 
Bohr- Sommerf eld approximation^, the quantum mechanical energy eigenvalues e v pertaining to this potential are 
obtained by setting the classical action 

I=^fpdx (A2) 

equal to h{y+ 1/2), where p = p{x) denotes the classical momentum at the position x, and v = 0, 1, 2, ... is an integer. 
Introducing the right turning point x v corresponding to classical motion with energy e u , 

x v = Lexp^^ , (A3) 

and accounting for the symmetry of the potential, the quantisation condition becomes 



Htt ( 1 



2 \" + 2 



dx \ 2 



J 2m (e u - Vq In ^ 
\/2mVo / dx a —In — 

Jo V x » 



^2mV x v v[^) , (A4) 



giving 



2mVr 



h[v+-) . (A5) 



Utilising the relation (|A3|) for expressing the turning point through the energy, this latter equation yields the desired 
approximate eigenvalues 



V v \2L V 2mV 



^=hu> + l)+ln( ± J-—) . (A6) 



APPENDIX B: DERIVATION OF THE RECURSION RELATION 



The derivation of the recursion relation @ presented here is not restricted to logarithmic energy levels and thus 
generalises the derivation previously given in ref^£. 

As in section|n] let Tk(N) be the number of possible factorisations of N into k or less natural numbers larger than 1. 
If there are less than k factors, say k — m, we multiply the product by l m . (This emphasises the correspondence with 
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the Bose condensate: The factor l m represents m particles which reside in the ground state, carrying no energy.) For 
example, 1^(12) gives rise to the four factorisations 

12 = 2 • 2 • 3 • 1 
= 2-6-1-1 
= 3-4-1- 1 

= 12 - 1 • 1 • 1 . (Bl) 

Keeping both N and k fixed, and randomly selecting one of the possible factorisations, the probability for the factor v 
to occur at least n times is given by 

r r fc _„(A7Q . N , n _ „ 
P*(n) = \ W ■ nmoaiy " u . (B2) 
I : else 

The probability to find the factor v exactly n times is then obtained as a difference, 

P„(n) = P£(n)-P£(n + l). (B3) 

Next, let # v be the number of occurrences of the factor v in some product. Taking the average over all possible 
products, we obtain 



71=1 



n=l 



^ E r k - n (N/^)- (B4) 



1=1 

N mod un=Q 



Since there is no possibility to factorise N > 1 such that zero factors are larger than 1, we have Tq(N > 1) = 0; since, 
however, there trivially is such a possibility for N = 1, it follows that r (l) = 1. 
By definition of we have for every factorisation 



N 



fc = (B5) 



which also holds upon averaging (k — Ylv=i )• Hence, 



fc = Ef^y E r fc _ n (jv/o. (B6) 



a = l 
JV mod i/™=0 



leading immediately to 

N k 



JV mod m^^O 



Since the sums are finite, we can safely exchange the order of summation and arrive at the recursion relation 10 • 

APPENDIX C: ASYMPTOTICS FOR PRODUCTS OF DISTINCT PRIMES 

With an obvious simplification of notation, the recursion relation (|18fl for the number of ways to decompose a 
product of m distinct primes into exactly k integer factors takes the form 

$m,fe = fc$ m _i )fe + $ m _i )fe _i ; $ m ,i = l, ® m ,k>m=0- (CI) 
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We will first show by induction over k that the solution to this relation is given by 



(-1) 



*™,* = ^>:(-i)TJ' m - (C2) 



For k = 1, equation l|C2(l obviously is correct. Moreover, we have 



k\ 



£ ( -i)^W-£(-r/fc 

i=i w i=i 



Using 



this yields 



k-l\ k-t(k 
£ 



(C3) 



(C4) 



fe$m-l,& + $m-l,fc-l 

(-i) fc 

fc! 

(-l) fe 



E(-i)^(^ m - 1 -E( 

£=i ^ ' e=i 

((-i^Q^+D- 1 )^! 



which proves the assertion (|C2|) . 

This result can now be employed to estimate the total number fi(Oi=iPi) °f factorisations for numbers i|17|) 
containing no identical prime factors: 

(m \ m 
Yi pi ) = E $m < fc 
i=l / k=l 

2^2-^ (k-ey. 7T 

fc=l £=1 v ' 
to /to- 



EE 



(-1)" \ r 



i/! / £! 

£=i Vi/=o / 

_ ^ r(m-l + l,-l) l m 
- e (m-*)! 7T ' 

where T(a, 6) denotes the incomplete Gamma function. The last equality here is proven by induction, using the 
recursive definition of T(m, —1): 

r(m+ 1,-1) = (-l) m e + mr(m,-l) , r(l,-l) = e. (C7) 

For large m, the sum (|C6(1 is dominated by terms with m — £ + 1 1, so that we may use the asymptotic relation 
T(m - £ + 1, -1) = t m - £ e -* dt - T(m - I + 1). This leads to the asymptotic equality 

/ to \ m „ m 

°(n» ~ eV 



m+1 

dx exp[/(x)] , (C8) 



where, according to Stirling's formula, 

f(x) ^ mhix ~ xlnx + x 
The saddle-point approximation to this latter integral l|C8t then gives 

r 



« n 



Pi 



27r exp 



— In to — 



resulting in the asymptotic bounds 



m In m — to < In 



In to 



< to In j 



